Enumerative encoding/decoding of variable-to-fixed-length codes for memoryless sources

نویسندگان

  • Yuriy A. Reznik
  • Anatoly V. Anisimov
چکیده

We offer novel algorithms for efficient encoding/decoding of variable-to-fixed length codes, requiring at most quadratic amount of space: O(L), where L is the depth of a coding tree. This is a major improvement compared to exponential O(2) usage of space by conventional techniques using complete representations of coding trees in computer’s memory. These savings are achieved by utilizing algebraic properties of VF coding trees constructed by using Tunstall or Khodak algorithms, and employing combinatorial enumeration techniques for encoding/decoding of codewords. The encoding/decoding complexity of our algorithms is linear with the number of symbols they process. As a side product, we also derive an exact formulae for the average redundancy of such codes under memoryless sources, and show its usefulness for analysis and design of codes with small number of codewords. 1 Definitions Consider a memoryless source S producing symbols from an input alphabet A = {a1, . . . , am} (2 6 m < ∞) with probabilities {pi = P (ai), i = 1, . . . , m}. By pmin, pmax we denote the probabilities of the leastand the most-likely symbols correspondingly, and assume that 0 < pmin 6 pmax < 1. Assume that a message Σ = {aik}k=1 is an infinite-length sequence of symbols produced by the source S. The main idea of variable length coding is to define a set of words X = {xj ∈ A∗, j = 1, . . . , M} (m 6 M < ∞), such that any message Σ can be uniquely represented by a sequence of words from X, and then map these words xj into codewords φ(xj) formed from letters of an output alphabet B = {b1, . . . , bn} (2 6 n < ∞): {aik}k=1 = {xjr}r=1 → {φ (xjr)}r=1 = {bis}s=1 . We further assume that the mapping φ is injective, and that the output code {φ (xj)} is uniquely decodable [4]. In this paper we only consider coding systems producing codewords φ(xj) with the same length. For example, we can simply pick |φ(xj)| = dlogn Me (1 6 j 6 M), and use indices of words xj to produce their codes. Such coding systems are called variable-length-to-block (VB) or variable-to-fixed-length (VF) codes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exponential bounds on error probability with Feedback

Feedback is useful in memoryless channels for decreasing complexity and increasing reliability; the capacity of the memoryless channels, however, can not be increased by feedback. For fixed length block codes even the decay rate of error probability with block length does not increase with feedback for most channel models. Consequently for making the physical layer more reliable for higher laye...

متن کامل

Variable-to-Fixed Length Codes and Plurally Parsable Dictionaries

A variable-toxed length encoding procedure is a mapping from a dictionary of variable length strings of source outputs to the set of codewords of a given length. For memoryless sources, the Tunstall procedure can be applied to construct optimal uniquely parsable dictionaries and the resulting codes are known to work especially well for sources with small entropies. We introduce the idea of plur...

متن کامل

Universal variable-to-fixed length source codes

A universal variable-to-fixed length algorithm for binary memoryless sources which converges to the entropy of the source at the optimal rate is known. We study the problem of universal variable-to-fixed length coding for the class of Markov sources with finite alphabets. We give an upper bound on the performance of the code for large dictionary sizes and show that the code is optimal in the se...

متن کامل

Source coding exponents for zero-delay coding with finite memory

Fundamental limits on the source coding exponents (or large deviations performance) of zero-delay finite-memory (ZDFM) lossy source codes are studied. Our main results are the following. For any memoryless source, a suitably designed encoder that time-shares (at most two) memoryless scalar quantizers is as good as any time-varying fixed-rate ZDFM code, in that it can achieve the fastest exponen...

متن کامل

Efficient universal noiseless source codes

Although the existence of universal noiseless variable-rate codes for the class of discrete stationary ergodic sources has previously been established, very few practical universal encoding methods are available. Efficient implementable universal source coding techniques are discussed in this paper. Results are presented on source codes for which a small value of the maximum redundancy is achie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007